Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
Adv Sci (Weinh) ; : e2205445, 2023 Jun 02.
Article in English | MEDLINE | ID: covidwho-20244847

ABSTRACT

The spread of coronavirus disease 2019 (COVID-19), caused by severe respiratory syndrome coronavirus 2 (SARS-CoV-2), has progressed into a global pandemic. To date, thousands of genetic variants have been identified among SARS-CoV-2 isolates collected from patients. Sequence analysis reveals that the codon adaptation index (CAI) values of viral sequences have decreased over time but with occasional fluctuations. Through evolution modeling, it is found that this phenomenon may result from the virus's mutation preference during transmission. Using dual-luciferase assays, it is further discovered that the deoptimization of codons in the viral sequence may weaken protein expression during virus evolution, indicating that codon usage may play an important role in virus fitness. Finally, given the importance of codon usage in protein expression and particularly for mRNA vaccines, it is designed several codon-optimized Omicron BA.2.12.1, BA.4/5, and XBB.1.5 spike mRNA vaccine candidates and experimentally validated their high levels of expression. This study highlights the importance of codon usage in virus evolution and provides guidelines for codon optimization in mRNA and DNA vaccine development.

2.
Virol J ; 20(1): 31, 2023 02 17.
Article in English | MEDLINE | ID: covidwho-2259752

ABSTRACT

BACKGROUND: Since the onset of the SARS-CoV-2 pandemic, bioinformatic analyses have been performed to understand the nucleotide and synonymous codon usage features and mutational patterns of the virus. However, comparatively few have attempted to perform such analyses on a considerably large cohort of viral genomes while organizing the plethora of available sequence data for a month-by-month analysis to observe changes over time. Here, we aimed to perform sequence composition and mutation analysis of SARS-CoV-2, separating sequences by gene, clade, and timepoints, and contrast the mutational profile of SARS-CoV-2 to other comparable RNA viruses. METHODS: Using a cleaned, filtered, and pre-aligned dataset of over 3.5 million sequences downloaded from the GISAID database, we computed nucleotide and codon usage statistics, including calculation of relative synonymous codon usage values. We then calculated codon adaptation index (CAI) changes and a nonsynonymous/synonymous mutation ratio (dN/dS) over time for our dataset. Finally, we compiled information on the types of mutations occurring for SARS-CoV-2 and other comparable RNA viruses, and generated heatmaps showing codon and nucleotide composition at high entropy positions along the Spike sequence. RESULTS: We show that nucleotide and codon usage metrics remain relatively consistent over the 32-month span, though there are significant differences between clades within each gene at various timepoints. CAI and dN/dS values vary substantially between different timepoints and different genes, with Spike gene on average showing both the highest CAI and dN/dS values. Mutational analysis showed that SARS-CoV-2 Spike has a higher proportion of nonsynonymous mutations than analogous genes in other RNA viruses, with nonsynonymous mutations outnumbering synonymous ones by up to 20:1. However, at several specific positions, synonymous mutations were overwhelmingly predominant. CONCLUSIONS: Our multifaceted analysis covering both the composition and mutation signature of SARS-CoV-2 gives valuable insight into the nucleotide frequency and codon usage heterogeneity of SARS-CoV-2 over time, and its unique mutational profile compared to other RNA viruses.


Subject(s)
COVID-19 , RNA Viruses , Humans , SARS-CoV-2/genetics , Nucleotides , COVID-19/genetics , Codon , Mutation , Genome, Viral , RNA Viruses/genetics , Evolution, Molecular
3.
PeerJ ; 10: e13562, 2022.
Article in English | MEDLINE | ID: covidwho-1903863

ABSTRACT

The ongoing prevailing COVID-19 pandemic caused by SARS-CoV-2 is becoming one of the major global health concerns worldwide. The SARS-CoV-2 genome encodes spike (S) glycoprotein that plays a very crucial role in viral entry into the host cell via binding of its receptor binding domain (RBD) to the host angiotensin converting enzyme 2 (ACE2) receptor. The continuously evolving SARS-CoV-2 genome results in more severe and transmissible variants characterized by the emergence of novel mutations called 'variants of concern' (VOC). The currently designated alpha, beta, gamma, delta and omicron VOC are the focus of this study due to their high transmissibility, increased virulence, and concerns for decreased effectiveness of the available vaccines. In VOC, the spike (S) gene and other non-structural protein mutations may affect the efficacies of the approved COVID-19 vaccines. To understand the diversity of SARS-CoV-2, several studies have been performed on a limited number of sequences. However, only a few studies have focused on codon usage bias (CUBs) pattern analysis of all the VOC strains. Therefore, to evaluate the evolutionary divergence of all VOC S-genes, we performed CUBs analysis on 300,354 sequences to understand the evolutionary relationship with its adaptation in different hosts, i.e., humans, bats, and pangolins. Base composition and RSCU analysis revealed the presence of 20 preferred AU-ended and 10 under-preferred GC-ended codons. In addition, CpG was found to be depleted, which may be attributable to the adaptive response by viruses to escape from the host defense process. Moreover, the ENC values revealed a higher bias in codon usage in the VOC S-gene. Further, the neutrality plot analysis demonstrated that S-genes analyzed in this study are under 83.93% influence of natural selection, suggesting its pivotal role in shaping the CUBs. The CUBs pattern of S-genes was found to be very similar among all the VOC strains. Interestingly, we observed that VOC strains followed a trend of antagonistic codon usage with respect to the human host. The identified CUBs divergence would help to understand the virus evolution and its host adaptation, thus help design novel vaccine strategies against the emerging VOC strains. To the best of our knowledge, this is the first report for identifying the evolution of CUBs pattern in all the currently identified VOC.

4.
Viruses ; 12(5)2020 04 30.
Article in English | MEDLINE | ID: covidwho-1726009

ABSTRACT

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which first occurred in Wuhan (China) in December of 2019, causes a severe acute respiratory illness with a high mortality rate, and has spread around the world. To gain an understanding of the evolution of the newly emerging SARS-CoV-2, we herein analyzed the codon usage pattern of SARS-CoV-2. For this purpose, we compared the codon usage of SARS-CoV-2 with that of other viruses belonging to the subfamily of Orthocoronavirinae. We found that SARS-CoV-2 has a high AU content that strongly influences its codon usage, which appears to be better adapted to the human host. We also studied the evolutionary pressures that influence the codon usage of five conserved coronavirus genes encoding the viral replicase, spike, envelope, membrane and nucleocapsid proteins. We found different patterns of both mutational bias and natural selection that affect the codon usage of these genes. Moreover, we show here that the two integral membrane proteins (matrix and envelope) tend to evolve slowly by accumulating nucleotide mutations on their corresponding genes. Conversely, genes encoding nucleocapsid (N), viral replicase and spike proteins (S), although they are regarded as are important targets for the development of vaccines and antiviral drugs, tend to evolve faster in comparison to the two genes mentioned above. Overall, our results suggest that the higher divergence observed for the latter three genes could represent a significant barrier in the development of antiviral therapeutics against SARS-CoV-2.


Subject(s)
Betacoronavirus/genetics , Codon , Coronavirus/genetics , Genome, Viral , Base Composition , Betacoronavirus/chemistry , Betacoronavirus/physiology , Biological Evolution , Coronavirus/classification , Genes, Viral , Host Specificity , Mutation , Phylogeny , SARS-CoV-2
5.
Virology ; 568: 56-71, 2022 03.
Article in English | MEDLINE | ID: covidwho-1665518

ABSTRACT

SARS-CoV-2, the seventh coronavirus known to infect humans, can cause severe life-threatening respiratory pathologies. To better understand SARS-CoV-2 evolution, genome-wide analyses have been made, including the general characterization of its codons usage profile. Here we present a bioinformatic analysis of the evolution of SARS-CoV-2 codon usage over time using complete genomes collected since December 2019. Our results show that SARS-CoV-2 codon usage pattern is antagonistic to, and it is getting farther away from that of the human host. Further, a selection of deoptimized codons over time, which was accompanied by a decrease in both the codon adaptation index and the effective number of codons, was observed. All together, these findings suggest that SARS-CoV-2 could be evolving, at least from the perspective of the synonymous codon usage, to become less pathogenic.


Subject(s)
COVID-19/epidemiology , COVID-19/virology , Codon Usage , Codon , Evolution, Molecular , Pandemics , SARS-CoV-2/genetics , Betacoronavirus/classification , Betacoronavirus/genetics , Gene Expression Regulation, Viral , Genome, Viral , Genomics/methods , Humans , Open Reading Frames , Organ Specificity , Phylogeny
6.
Viruses ; 13(9)2021 09 10.
Article in English | MEDLINE | ID: covidwho-1411076

ABSTRACT

The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the third human-emerged virus of the 21st century from the Coronaviridae family, causing the ongoing coronavirus disease 2019 (COVID-19) pandemic. Due to the high zoonotic potential of coronaviruses, it is critical to unravel their evolutionary history of host species breadth, host-switch potential, adaptation and emergence, to identify viruses posing a pandemic risk in humans. We present here a comprehensive analysis of the composition and codon usage bias of the 82 Orthocoronavirinae members, infecting 47 different avian and mammalian hosts. Our results clearly establish that synonymous codon usage varies widely among viruses, is only weakly dependent on their primary host, and is dominated by mutational bias towards AU-enrichment and by CpG avoidance. Indeed, variation in GC3 explains around 34%, while variation in CpG frequency explains around 14% of total variation in codon usage bias. Further insight on the mutational equilibrium within Orthocoronavirinae revealed that most coronavirus genomes are close to their neutral equilibrium, the exception being the three recently infecting human coronaviruses, which lie further away from the mutational equilibrium than their endemic human coronavirus counterparts. Finally, our results suggest that, while replicating in humans, SARS-CoV-2 is slowly becoming AU-richer, likely until attaining a new mutational equilibrium.


Subject(s)
COVID-19/epidemiology , COVID-19/virology , Codon Usage , Genome, Viral , Mutation , SARS-CoV-2/genetics , Selection, Genetic , Evolution, Molecular , Host-Pathogen Interactions/genetics , Humans , Pandemics
7.
Biomolecules ; 11(6)2021 06 18.
Article in English | MEDLINE | ID: covidwho-1273388

ABSTRACT

The ongoing outbreak of coronavirus disease COVID-19 is significantly implicated by global heterogeneity in the genome organization of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The causative agents of global heterogeneity in the whole genome of SARS-CoV-2 are not well characterized due to the lack of comparative study of a large enough sample size from around the globe to reduce the standard deviation to the acceptable margin of error. To better understand the SARS-CoV-2 genome architecture, we have performed a comprehensive analysis of codon usage bias of sixty (60) strains to get a snapshot of its global heterogeneity. Our study shows a relatively low codon usage bias in the SARS-CoV-2 viral genome globally, with nearly all the over-preferred codons' A.U. ended. We concluded that the SARS-CoV-2 genome is primarily shaped by mutation pressure; however, marginal selection pressure cannot be overlooked. Within the A/U rich virus genomes of SARS-CoV-2, the standard deviation in G.C. (42.91% ± 5.84%) and the GC3 value (30.14% ± 6.93%) points towards global heterogeneity of the virus. Several SARS-CoV-2 viral strains were originated from different viral lineages at the exact geographic location also supports this fact. Taking all together, these findings suggest that the general root ancestry of the global genomes are different with different genome's level adaptation to host. This research may provide new insights into the codon patterns, host adaptation, and global heterogeneity of SARS-CoV-2.


Subject(s)
COVID-19/virology , Codon Usage , Genome, Viral , SARS-CoV-2/genetics , Evolution, Molecular , Humans , Mutation , Phylogeny
8.
J Mol Evol ; 89(6): 341-356, 2021 07.
Article in English | MEDLINE | ID: covidwho-1227833

ABSTRACT

Severe Acute Respiratory Syndrome Coronavirus-2 is a zoonotic virus with a possible origin in bats and potential transmission to humans through an intermediate host. When zoonotic viruses jump to a new host, they undergo both mutational and natural selective pressures that result in non-synonymous and synonymous adaptive changes, necessary for efficient replication and rapid spread of diseases in new host species. The nucleotide composition and codon usage pattern of SARS-CoV-2 indicate the presence of a highly conserved, gene-specific codon usage bias. The codon usage pattern of SARS-CoV-2 is mostly antagonistic to human and bat codon usage. SARS-CoV-2 codon usage bias is mainly shaped by the natural selection, while mutational pressure plays a minor role. The time-series analysis of SARS-CoV-2 genome indicates that the virus is slowly evolving. Virus isolates from later stages of the outbreak have more biased codon usage and nucleotide composition than virus isolates from early stages of the outbreak.


Subject(s)
COVID-19/epidemiology , COVID-19/virology , Codon Usage/genetics , Evolution, Molecular , Host-Pathogen Interactions/genetics , SARS-CoV-2/genetics , SARS-CoV-2/physiology , Adaptation, Physiological/genetics , Animals , COVID-19/transmission , Chiroptera/genetics , Genome, Viral/genetics , Humans , Mutation , Pandemics , Principal Component Analysis , Selection, Genetic/genetics , Time Factors , Virus Replication
9.
J Biomed Inform ; 118: 103801, 2021 06.
Article in English | MEDLINE | ID: covidwho-1219153

ABSTRACT

Understanding the molecular mechanism of COVID-19 pathogenesis helps in the rapid therapeutic target identification. Usually, viral protein targets host proteins in an organized fashion. The expression of any viral gene depends mostly on the host translational machinery. Recent studies report the great significance of codon usage biases in establishing host-viral protein-protein interactions (PPI). Exploring the codon usage patterns between a pair of co-evolved host and viral proteins may present novel insight into the host-viral protein interactomes during disease pathogenesis. Leveraging the similarity in codon usage patterns, we propose a computational scheme to recreate the host-viral protein-protein interaction network. We use host proteins from seventeen (17) essential signaling pathways for our current work towards understanding the possible targeting mechanism of SARS-CoV-2 proteins. We infer both negatively and positively interacting edges in the network. Further, extensive analysis is performed to understand the host PPI network topologically and the attacking behavior of the viral proteins. Our study reveals that viral proteins mostly utilize codons, rare in the targeted host proteins (negatively correlated interaction). Among them, non-structural proteins, NSP3 and structural protein, Spike (S), are the most influential proteins in interacting with multiple host proteins. While ranking the most affected pathways, MAPK pathways observe to be the worst affected during the SARS-CoV-2 infection. Several proteins participating in multiple pathways are highly central in host PPI and mostly targeted by multiple viral proteins. We observe many potential targets (host proteins) from the affected pathways associated with the various drug molecules, including Arsenic trioxide, Dexamethasone, Hydroxychloroquine, Ritonavir, and Interferon beta, which are either under clinical trial or in use during COVID-19.


Subject(s)
COVID-19 , Codon Usage , Host-Pathogen Interactions , Protein Interaction Maps , Signal Transduction , COVID-19/diagnosis , COVID-19/therapy , Humans
10.
Gene Rep ; 23: 101055, 2021 Jun.
Article in English | MEDLINE | ID: covidwho-1086939

ABSTRACT

The novel corona virus disease or COVID-19 caused by a positive strand RNA virus (PRV) called SARS-CoV-2 is plaguing the entire planet as we conduct this study. In this study a multifaceted analysis was carried out employing dinucleotide signature, codon usage and codon context to compare and unravel the genomic as well as genic characteristics of the SARS-CoV-2 isolates and how they compare to other PRVs which represents some of the most pathogenic human viruses. The main emphasis of this study was to comprehend the codon biology of the SARS-CoV-2 in the backdrop of the other PRVs like Poliovirus, Japanese encephalitis virus, Hepatitis C virus, Norovirus, Rubella virus, Semliki Forest virus, Zika virus, Dengue virus, Human rhinoviruses and the Betacoronaviruses since codon usage pattern along with the nucleotide composition prevalent within the viral genome helps to understand the biology and evolution of viruses. Our results suggest discrete genomic dinucleotide signature within the PRVs. Some of the genes from the different SARS-CoV-2 isolates were also found to demonstrate heterogeneity in terms of their dinucleotide signature. The SARS-CoV-2 isolates also demonstrated a codon context trend characteristically dissimilar to the other PRVs. The findings of this study are expected to contribute to the developing global knowledge base in countering COVID-19.

11.
Infect Genet Evol ; 89: 104736, 2021 04.
Article in English | MEDLINE | ID: covidwho-1051860

ABSTRACT

Since 2002, the world has witnessed major outbreaks of acute respiratory illness by three zoonotic coronaviruses (CoVs), which differ from each other in pathogenicity. Reasons for the lower pathogenicity of SARS-CoV-2 than the other two zoonotic coronaviruses, SARS-CoV and MERS-CoV, are not well understood. We herein compared the codon usage patterns of the three zoonotic CoVs causing severe acute respiratory syndromes and four human-specific CoVs (NL63, 229E, OC43, and HKU1) causing mild diseases. We found that the seven viruses have different codon usages, with SARS-CoV-2 having the lowest effective number of codons (ENC) among the zoonotic CoVs. Human codon adaptation index (CAI) analysis revealed that the CAI value of SARS-CoV-2 is the lowest among the zoonotic CoVs. The ENC and CAI values of SARS-CoV-2 were more similar to those of the less-pathogenic human-specific CoVs. To further investigate adaptive evolution within SARS-CoV-2, we examined codon usage patterns in 3573 genomes of SARS-CoV-2 collected over the initial 4 months of the pandemic. We showed that the ENC values and the CAI values of SARS-CoV-2 were decreasing over the period. The low ENC and CAI values could be responsible for the lower pathogenicity of SARS-CoV-2. While mutational pressure appears to shape codon adaptation in the overall genomes of SARS-CoV-2 and other zoonotic CoVs, the E gene of SARS-CoV-2, which has the highest codon usage bias, appears to be under strong natural selection. Data from the study contribute to our understanding of the pathogenicity and evolution of SARS-CoV-2 in humans.


Subject(s)
Adaptation, Physiological/genetics , Codon , SARS-CoV-2/genetics , Zoonoses/genetics , Animals , COVID-19/virology , Evolution, Molecular , Genome, Viral , Humans , Phylogeny , SARS-CoV-2/physiology , Species Specificity
12.
Int J Mol Sci ; 21(21)2020 Oct 24.
Article in English | MEDLINE | ID: covidwho-895369

ABSTRACT

Transmissible gastroenteritis virus (TGEV) is a coronavirus associated with diarrhea and high mortality in piglets. To gain insight into the evolution and adaptation of TGEV, a comprehensive analysis of phylogeny and codon usage bias was performed. The phylogenetic analyses of maximum likelihood and Bayesian inference displayed two distinct genotypes: genotypes I and II, and genotype I was classified into subtypes Ia and Ib. The compositional properties revealed that the coding sequence contained a higher number of A/U nucleotides than G/C nucleotides, and that the synonymous codon third position was A/U-enriched. The principal component analysis based on the values of relative synonymous codon usage (RSCU) showed the genotype-specific codon usage patterns. The effective number of codons (ENC) indicated moderate codon usage bias in the TGEV genome. Dinucleotide analysis showed that CpA and UpG were over-represented and CpG was under-represented in the coding sequence of the TGEV genome. The analyses of Parity Rule 2 plot, ENC-plot, and neutrality plot displayed that natural selection was the dominant evolutionary driving force in shaping codon usage preference in genotypes Ia and II. In addition, natural selection played a major role, while mutation pressure had a minor role in driving the codon usage bias in genotype Ib. The codon adaptation index (CAI), relative codon deoptimization index (RCDI), and similarity index (SiD) analyses suggested that genotype I might be more adaptive to pigs than genotype II. Current findings contribute to understanding the evolution and adaptation of TGEV.


Subject(s)
Codon Usage , Evolution, Molecular , Transmissible gastroenteritis virus/genetics , CpG Islands , Genome, Viral , Selection, Genetic
13.
Front Cell Dev Biol ; 8: 831, 2020.
Article in English | MEDLINE | ID: covidwho-769194

ABSTRACT

Severe acute respiratory syndrome has spread quickly throughout the world and was declared a pandemic by the World Health Organization (WHO). The pathogenic agent is a new coronavirus (SARS-CoV-2) that infects pulmonary cells with great effectiveness. In this study we focus on the codon composition for the viral protein synthesis and its relationship with the protein synthesis of the host. Our analysis reveals that SARS-CoV-2 preferred codons have poor representation of G or C nucleotides in the third position, a characteristic which could result in an unbalance in the tRNAs pools of the infected cells with serious implications in host protein synthesis. By integrating this observation with proteomic data from infected cells, we observe a reduced translation rate of host proteins associated with highly expressed genes and that they share the codon usage bias of the virus. The functional analysis of these genes suggests that this mechanism of epistasis can contribute to understanding how this virus evades the immune response and the etiology of some deleterious collateral effect as a result of the viral replication. In this manner, our finding contributes to the understanding of the SARS-CoV-2 pathogeny and could be useful for the design of a vaccine based on the live attenuated strategy.

14.
J Med Virol ; 92(4): 433-440, 2020 04.
Article in English | MEDLINE | ID: covidwho-2245

ABSTRACT

The current outbreak of viral pneumonia in the city of Wuhan, China, was caused by a novel coronavirus designated 2019-nCoV by the World Health Organization, as determined by sequencing the viral RNA genome. Many initial patients were exposed to wildlife animals at the Huanan seafood wholesale market, where poultry, snake, bats, and other farm animals were also sold. To investigate possible virus reservoir, we have carried out comprehensive sequence analysis and comparison in conjunction with relative synonymous codon usage (RSCU) bias among different animal species based on the 2019-nCoV sequence. Results obtained from our analyses suggest that the 2019-nCoV may appear to be a recombinant virus between the bat coronavirus and an origin-unknown coronavirus. The recombination may occurred within the viral spike glycoprotein, which recognizes a cell surface receptor. Additionally, our findings suggest that 2019-nCoV has most similar genetic information with bat coronovirus and most similar codon usage bias with snake. Taken together, our results suggest that homologous recombination may occur and contribute to the 2019-nCoV cross-species transmission.


Subject(s)
Betacoronavirus/genetics , Chiroptera/virology , Coronavirus Infections/transmission , Coronavirus Infections/virology , Disease Reservoirs , Pneumonia, Viral/transmission , Pneumonia, Viral/virology , Snakes/virology , Spike Glycoprotein, Coronavirus/genetics , Animals , Betacoronavirus/classification , Betacoronavirus/physiology , Bungarus/genetics , Bungarus/virology , COVID-19 , Chiroptera/genetics , Codon Usage , Coronavirus Infections/epidemiology , Disease Outbreaks , Evolution, Molecular , Genome, Viral , Homologous Recombination , Host Specificity , Humans , Naja naja/genetics , Naja naja/virology , Phylogeny , Pneumonia, Viral/epidemiology , SARS-CoV-2 , Snakes/genetics , Zoonoses
SELECTION OF CITATIONS
SEARCH DETAIL